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(54) NOVEL SERINE-THREONINE KINASE GENE 

(57) A novel gene having the consensus sequence 
of a serine-threonine kinase active site has been iso- 
lated by the suppression subtractive hybridization 
methcxJ which comprises preparing a library of genes 
expressed specifically in fetat livers and isolating clones 
from this library at random. This gene presumably par- 
ticipates in cell growth control because it is highly 
expressed, especially in actively growing cells, and 
exhibits a significant homology with a vaccinia virus 
B1 R kinase gene. Thus, it can be utilized as a target for 
developing cell growth inhibitors or antitumor agents. 
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Descripti n 

Technical Field 

5 [0001] The present invention relates to a novel serine-threonine kinase gene. 
Background Art 

[0002] Fetal tissues are comprised of many undifferentiated cells that proliferate actively, highly activated cells, nas- 
70 cent vascular endothelial cells, and so on. Although the activity of these cells in fetal tissues is stringently regulated and 
inhibited as individuals mature, the state of fetal tissues can be considered similar to that of a solid tumor except that 
the activity is regulated. Therefore, some of the genes expressed specifically or more strongly in fetal tissues (fetal 
genes) can be genes involved in the phenomena characteristic of solid tumors such as abnormal growth, immortaliza- 
tion, infiltration, metastasis, and angiogenesis. In addition, some diseases other than tumors are also supposed to arise 
15 because fetal genes, which are repressed in a normal living body, are abnormally activated. Therefore, genes involved 
in various diseases such as tumors can be screened by isolating and analyzing fetal genes. 

[0003] However, there are still few reports on systematic analysis focusing merely on fetal genes from these view- 
points, and at present there is a far from perfect understanding of these gene groups. 

20 Disclosure of the Invention 

[0004] An objective of this invention is to isolate genes expressed specifically in fetal tissues and to screen genes 
related to diseases. 

[0005] The present inventors thought that fetal tissue cells could be a model for solid tumor cells and that genes 
25 involved in diseases such as tumors could be screened by isolating and analyzing fetal genes. Furthermore, the present 
inventors thought it possible to develop a medicine with a novel action mechanism by designing drugs targeting the 
genes. Based on these thoughts, the present inventors have tried to isolate fetal genes. 

[0006] Specifically the present inventors prepared a subtraction library with many genes expressed specifically in 
fetal livers (or more strongly than adult livers) by the suppression subtractive hybridization method, extracted clones 

30 from this library at random, and analyzed their structure. By doing so, the present inventors succeeded in isolating a 
novel gene, VRK1 , having the consensus sequence of a serine-threonine kinase active site. The present inventors also 
performed a data base search based on the amino acid sequence deduced from tiie isolated gene. The present inven- 
tors thus have found tiiis gene product exhibits a significant homology with B1 R kinase, which is presumably involved 
in DNA replication of vaccinia virus. In addrtion, the present inventors found human EST having a very high homology 

35 with this gene in the database and isolated its full-length cDNA. VRK2. Analyzing the expression of the two isolated 
genes in various cells by northern blot analysis showed that these genes are strongly expressed, especially in actively 
growing cells such as human fetal livers, testes, and various tumor cell lines. Furthermore, the present inventors have 
found that the VRK1 protein actually has protein kinase activity. 

[0007] Thus, the present invention relates to novel serine-threonine kinase genus, VRK1 and VRK2. More specifically, 
40 the present irrvention relate to: 

(1) a protein having the amino acid sequence of SEQ ID NO: 2. or a protein having the same amino acid sequence 
where one or more amino adds are added, deleted, or substituted and having serine-threonine kinase activity. 

(2) a protein having the amino acid sequence of SEQ ID NO: 4. or a protein having the same amino acid sequence 
45 where one or more amino acids are added, deleted, or substituted and having serine-threonine kinase activity, 

(3) a protein encoded by a DNA sequence that hybridizes with the DNA sequence of SEQ ID NO: 1 or its comple- 
mentary sequence and having serine-threonine kinase activity. 

(4) a protein encoded by a DNA sequence that hybridizes with the DNA sequence of SEQ ID NO: 3 or its comple- 
mentary sequence and having serine-threonine kinase activity, 

50 (5) a DNA encoding the protein of any one of (1) to (4). 

(6) a vector comprising the DNA of (5), 

(7) a transformant carrying the vector of (6). 

(8) a method of producing the protein of any one of (1) to (4), wherein the method comprises cultivating the trans- 
formant of (7), 

55 (9) an antibody binding to tiie protein of any one of (1) to (4), 

(10) an antisense DNA against the DNA of (5) or part of it. 

(11) a method of saeening compounds having inhibitory actrvity of ssrine-thj-eonine kinase activity of the protein of 
any one of (1) to (4). wherein the method comprises 
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(a) cxDntacting the protein of any one of (1) to (4) with a substrate to be phosphorylated by this protein in the 
presence of a test compound to detect the kinase activity of the protein of any one of (1) to (4). and 

(b) comparing the kinase activity detected in step (a) with that detected in the absence of the test compound 
and selecting a compound that lowers the kinase activity of the protein of any one of (1) to (4). 

5 

[0008] The present invention relates to novel serine-threonine kinases, "VRK1 " and ''VRK2.'' The nucleotide sequence 
of the "VRKr cDNA and the amino acid sequence of the protein are shown in SEQ ID NO: 1 and 2, respectively. In 
addition, the nucleotide sequence of the "VRK2" cDNA and the amino acid sequence of the protein are shown in SEQ 
ID NO: 3 and 4, respectively. "VRKI" cDNA has a significant homology with B1 R kinase, which is presumably involved 

10 In DNA replication of vaccinia virus. The gene is also characterized by its strong expression in actively growing cells 
such as fetal livers, testes, and various tumor cell lines. In addition, overexpression of "VRKl" protein drastically 
increases the growing activity of NIH3T3 cells. These facts imply "VRKI" is involved in the regulation mechanism of cell 
growth. "VRK1" protein has protein kinase activity, which presumably plays an Important roll in the regulation of cell 
growth. •'VRK2" has a high homology with "VRKI ." especially in the serine-threonine kinase site. "VRK2," like "VRK1 

15 has a significant homology with B1 R kinase, and the gene is characterized by its strong expression in actively growing 
cells such as fetal livers, testes, and various tumor cell lines. These facts inply "VRK2" has the same function as that 
of "VRK1/ 

[0009] "VRK1 " and "VRKa" proteins can be prepared as recombinant proteins with recombinant DNA techniques or 
as natural proteins. The recombinant proteins can be prepared, for example, by cultivating cells transformed with the 

20 DMAs encoding these proteins, as will be described later. Natural proteins can be isolated from fetal livers, testes, or 
tumor cell strains such as HeLa S3, in which these proteins are higNy expressed, by a method well-known to one skilled 
in the art. such as affinity chromatography with the antibodies of the present invention as described later. Either poly- 
clonal or monoclonal antibodies can be used. The polyclonal antibodies can be prepared from, for example, serum from 
small animals such as rabbits immunized with these proteins by, for exanrple. ammonium sulfate precipitation, protein 

25 A- or protein G-column chromatography, DEAE ion exchange chromatography, affinity chromatography using a column ^ . 

coupled with these proteins, etc. The monoclonal antibodies can be prepared as follows. First, a small animal such as % 
a mouse is immunized with these proteins. The spleen is extracted from the mouse and dissociated to cells. The result- 
ing cells are fused to mouse myeloma cells using a reagent such as polyethylene glycol, and the clone that produces ^ 
antibodies against these proteins is screened from the fusion cells (hybridoma) thus generated. The hybridoma thus ^< 

30 obtained is then transplanted into a mouse abdominal cavity. Ascites is collected from the mouse and purified by, for 
example, ammonium sulfate precipitation, protein A- or protein G-column chromatography, DEAE ion exchange chro- 
matography, affinity chromatography using a column coupled with "VRKI" or "VRK2" protein, etc. If the antibodies 
obtained are to be used for administering to a human body (for antibody therapy or the like), humanized antibodies or 1 
human antibodies should be used to decrease immunogenicity. An exanple of methods for humanizing antibodies is l 

35 the CDR graft method, in which an antibody gene is cloned from monoclonal antibody-producing cells and its antigenic i 
determinant is transplanted to an existing human antibody Besides, human antibodies can be directly prepared just like t 
usual monoclonal antibodies by immunizing a mouse whose immune system is replaced with a human immune system. 
[001 0] Furthermore, one skilled in the art can prepare not only natural "VRKI " and "VRK2" proteins (SEQ ID NO: 2 
and 4, respectively) but also proteins with substantially the same function as that of the natural proteins, if needed, by 

40 replacing amino acids in the proteins by a well-known method. Besides, mutations of amino acids in proteins can occur 
naturally Thus, mutant proteins with serine-threonine kinase activity that are generated by introducing amino acid sub- 
stitution, deletion, or addition into the natural proteins are also included in the proteins of the present invention. Methods 
for amino acid alteration, for example, a site<Jirected mutagenesis system using PGR (GIBCO-BRL, Gaithersburg, Mar- 
yland), the oligonucleotide-mediated site-directed mutagenesis method (Kramer, W. and Fritz, HJ (1987) Methods in 

45 Enzymol. 154:350-367), and the Kunkel method (Methods Enzymol. 85. 2763-2766 (1988)). are well-known to one 
skilled in the art. Furthermore, usually ten or less, preferably six or less, and more preferably three or less amino acids 
are substituted. The site where substitution, deletion, or addition is introduced is not particularly limited as long as the 
serine-threonine kinase activity is maintained. From the viewpoint of protein activity, the addition, deletion, or substitu- 
tion of amino adds should be performed in a region other than the region corresponding to the consensus sequence of 

so a serine-threonine kinase active site and to the consensus sequence of a protein kinase ATP binding site. Moreover, 
serine-threonine kinase activity of a protein can be detected, for example, by the method described in Exanrple 9, men- 
tioned later. 

[0011] Furthermore, one skilled in the art can usually isolate DNAs having a high homology with the DNA encoding 
"VRKr or "VRK2" protein (SEQ ID NO: 1 or 3. respectively) based an tiie DNA or the part of it using a hybridization 
55 technique (SambrooK J. et al.. Molecular Cloning 2nd ed. 9.47-9.58, Cold Spring Harbor Lab. Press. 1989) and obtain 
proteins having substantially the same function as VRK1 or VRK2 protein (SEQ ID NO: 2 or 4, respectively) from tiie 
DNA. Thus, proteins with serine-threonine kinase activity that are encoded by DNAs hybridizing with DNA encoding 
•"VRKI " or "VRK2" protein are also included in the proteins of tiie present invention. Hybridizing DNAS are isolated from 
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other organisms including, for example, mice, rats, rabbits, bovines. and so on. Tissues such as fetal livers and testes 
are especially suitable for isolating. Thus isolated DMAs encoding proteins having substantially the same function as 
that of *'VRKr or "VRK2'' proteins usually have a high homology with the DNA (SEQ ID NO: 1 or 3) encoding "VRKr 
or "VRK2" protein, respectively. The term "high homology" used herein means at least 40% or more, preferably 60% or 
5 more, and more preferably 80% or more of sequence identity at the amino acid level. From the viewpoint of the protein 
activity, a high homology should be found in the regions corresponding to the consensus sequence of a serine-threo- 
nine kinase active site and to the consensus sequence of a protein kinase ATP binding site. 

[0012] Examples of conditions for hybridization to isolate these DNAs are as follows. After prehybridization at 55*C 
for 30 minutes or longer, hybridization is performed by adding labeled probes and incubating at 37 to 55**C for an hour 

10 or longer using "ExpressHyb Hybridization Solution" (CLONTECH). After that, the resulting hybridized product Is 
washed three times for 20 minutes each at room temperature in 2 x SSC and 0.1% SDS then once at 37°C in 1 x SSC 
and 0.1% SDS. More preferably, after prehybridization at 60°C for 30 minutes or longer, hybridization is performed by 
adding labeled probes and incubating at 60**C for an hour or longer using "ExpressHyb Hybridization Solution" (CLON- 
TECH). Thereafter, the hykxidized product is washed three times for 20 minutes each at room temperature in 2 x SSC 

15 and 0.1% SDS then twice at 50**C in 1 x SSC and 0.1% SDS. Still more preferably, after prehybridization at 68*C for 30 
minutes or longer, hybridization is performed by adding labeled probes and incubating at 68 **C for an hour or longer 
using "ExpressHyb Hybridization Solution" (CLONTECH). Thereafter, the hybridized product is washed three times for 
20 minutes each at room temperature in 2 x SSC and 0.1% SDS then twice at 50**C in 0.1 x SSC and 0.1% SDS. 
[001 3] The present invention also relates to the DNAs encoding the above-described proteins of the present invention. 

20 The DNAs of the present invention include CDNAs, genomic DNAs, and synthetic DNAs as long as they encode the 
proteins of the present invention. The DNAs of the present invention can be used to produce the recombinant proteins. 
Specifically, the recombinant proteins can be prepared by inserting the DNA (for example, the DNA of SEQ ID NO: 1 or 
3) of the present invention into a suitable expression vector, cultivating the transformant obtained by introducing the vec- 
tor into suitable cells, and purifying the expressed proteins. For example, mammalian cells such as COS. CHO. or 

25 NIH3T3 cells; insect cells such as Sf9 cells; yeast cells; and E. coli cells can be used for producing the recombinant 
proteins. Vectors for expressing recombinant proteins in these cells vary depending on the host cells. For example. 
pcDNAS (Invitrogen) or pEF-BOS (Nucleic Acids. Res. 1990. 18(1 7). p5322) is used for mammalian cells: "BAC-to-BAC 
baculovirus expression system" (GIBCO BRL). for insect cells; "Pichia Expression Kit" (Invitrogen). for yeast cells; and 
pGEX-5X-1 (Pharmacia) or "QIAexpress system" (Qiagen). for E. coli cells. Vectors can be introduced into host cells by, 

30 for example, the method using calcium phosphate. DEAE dextran, or cationic liposome DOTAP (Boehringer Man- 
nheim); electroporation; the calcium chloride method; etc. The recombinant proteins can be purified from the obtained 
transformants by the usual methods such as the method described in "The Qiaexpressionist handbooK Qiagen. Hilden. 
Germany" 

[001 4] Furthermore, the DNAs of the present invention can be used for gene therapy of diseases caused by mutations 
35 in genomic DNAs. In gene therapy the DNAs of the present invention are administered to a living body inserted into 
adenovirus vectors (e.g., pAdexLcw), retrovirus vectors (e.g., pZIPneo) and so on. They can be administered by either 
ex vivo methods or in vivo methods. 

[0015] Furthermore, since the proteins of the present invention are presumably involved in the regulation of cell 
growth, antisense DNAs against the DNAs of the present invention or part of them can be used as inhibitors for devel- 
40 oping cell growth or as antitumor agents. The antisense DNAs are administered to a living body directly or in the form 
of the vectors into which they have been inserted. The antisense DNAs can be synthesized by methods well known to 
one skilled in the art. 

[001 6] The present invention also relates to a method of screening compounds having inhibitory activity of serine- 
threonine kinase activity of the proteins of the present invention. This screening method comprises two steps. Rrst, the 
45 protein of the present invention is caused to contact a substrate to be phosphorytated by this protein in the presence of 
a test compound to detect the kinase activity of the protein of the present invention. Second, the kinase activity detected 
in step (a) is compared with that detected in the absence of the test compound, and a compound that lowers the kinase 
activity of the protein of the present invention is selected. 

[0017] Test compounds used for this screening method are not particularly limited and are generally low-molecular- 
50 weight compounds, proteins (including the above-described antibodies of the present invention), peptides, etc. Test 
compounds are either artificially synthesized or natural. Substrates are, for example, casein. IkBa protein, etc. The 
kinase activity of the protein of the present invention can be detected, for example, by adding ATP having radioactively 
labeled phosphate to the reaction system containing the protein of the present invention and the substrate and meas- 
uring the radioactivity of the phosphate attached to the substrate. Specifically, the activity is detected by the method 
55 described in Example 9. The connpounds thus isolated can be used as cell growth inhibitors or antitumor agents. More- 
over, the present inventors learned that "VRKI" protein phosphorylates IkBa protein. IkBa is thought to be rapidly 
degraded when phosphorylated, thereby releasing and activating NF-kB bound thereto. In addition. NF-kB is well 
known as a central transcriptional regulator that causes widespread immune reactions and inflammation reactions. 
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Therefore, compounds that inhibit the kinase activity of the proteins of the present invention can be used as antiphlo- 
gistics and immunosuppressants. 

Brief Descriptinn of the Prawinos 

[0018] 



Figure 1 shows the adapters used for constructing the subtraction library. 

Figure 2A shows the consensus sequence of the active site of serine-threonine kinase. Figure 2B shows the con- 

10 sensussequenceofthe ATP binding site of protein kinase. 

Figure 3 shows the nucleotide sequence of the clone f Is223 and its deduced amino acid sequences. 
Figure 4 shows electrophoretic patterns demonstrating the result of RT-PCR analysis performed to detect expres- 
sions of VRK1 and VRK2 genes in fetal and adult livers. In the figure. "A" and "F" represent Adult liver, and Fetal 
liver. "Low." "Middle." and "High" represent the level of PGR cycles. 

75 Figure 5 compares amino acid sequences of VRK1 and 81 R. 

Figure 6 compares amino add sequences of VRK1 and VRK2. 
Figure 7 compares amino acid sequences of VRK2 and 81 R. 

Figure 8 shows electrophoretic patterns demonstrating the result of northern blot analysis of the expression of 
VRK1 and VRK2 genes in various cells. 
20 Figure 9 shows an electrophoretic pattern demonstrating the result of western blot analysis using anti c-Myc anti- 
body. Cell extracts from COS7 cells transfected with plasmid DNA. pcDNAS (lane 1 ) or pcDNASA/RKI myc (lane 2). 
were examined. 

Figure 10 shows an electrophoretic pattern demonstrating the result of northern blot analysis using the VRK1 
cDNA as a probe. Total RNA samples prepared from NIH3T3 cells transfected with plasmid DNA. pCOS (lane 1) 
25 or pCOS/VRKIw (lane 2), and from a human hepatoma cell line, HepG2 cells (lane 3). were examined. 

Figure 1 1 presents microscopic photographs showing the result of colony assay. A pool of NIH3T3 cells transfected 
with plasmid DNA. pCOS ("pCOS") or pCOSA/RKI w ("pCOS/VRKI w") was examined. 

Figure 1 2 shows an electrophoretic pattern of purified GST fusion proteins (C8B staining). Fusion proteins with wild 

VRK1 protein (lane 1) or with a mutant one (lane 2) were examined. 
30 Figure 1 3 shows electrophoretic patterns demonstrating the result of kinase assay. Added proteins are indicated by 
on the upper portion. Arrows indicate phosphorylated GST-VRK1 ("A." autophosphorylation). phosphorylated 

casein ("C"). phosphorylated GST-lkBa (T). and phosphorylated Ik8a C-termlnal peptide ("P^- 

Figure 14 shows an electrophoretic pattern demonstrating the result of kinase assay. Reactions were performed in 

the presence of various divalent cations at various concentrations as indicated on the right. Arrows indicate phos- 
35 phorylated GST-VRK1 ("A," autophosphorylation). and phosphorylated casein ("C"). 

Figure 15 shows an electrophoretic pattern demonstrating the result of western blot analysis with an antibody 

against a VRK1 peptide using K562 cell extracts. 



Best Mode for Imolementina the Invention 

[0019] The present invention is illustrated below in detail with reference to examples, but is not to be construed as 
being limited thereto. 

Example 1 

Construction of a subtraction library 



[0020] A subtraction library was prepared using the PCR-Select^" cDNA Subtraction kit (CLONTECH) basically 
according to the method described by Luda Diatchenko et al. (Proc. Natl. Acad. Sci. USA, Vol.93. 6025-6030. 1996). 

so [0021 ] First, double-stranded cDNAs were synthesized from polyA* RNA prepared from human fetal and adult livers 
by the standard method using MMLV reverse transcriptase. Next, the respective CDNAs were blunt-ended with T4 DNA 
polymerase, then cleaved by Rsa\, A part of the cDNA originating from fetal liver (tester) was split in two; one of which 
was ligated with the adapter-1 and the other with the adapter-2 (Figure 1). Each aliquot was mixed with an excess 
amount of the adult liver cDNA (driver), denatured by heat, and subjected to the first hybridization at 68°C for 8 hours. 

55 Aliquots were then combined without heat denaturation. mixed further with an excess amount of heat-denatured driver, 
and subjected to the second hybridization at 68*'C for about 16 hours. The mixture was diluted in the dilution buffer, 
incubated at 75°C for 7 minutes to remove the shorter strands of adapters, and used as a template for PCR. By per- 
forming PGR with primers corresponding to the adapters. "PCR primer-1" (SEQ ID NO: 5) and "PCR primer-2" (SEQ 
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ID NO: 6), cDNAs carrying different adapters on their two ends (subtracted cDNAS) were selectively amplified (sup- 
pression PGR). To obtain products with further selectivity, a portion of the amplified products was used as a template 
for PGR with primers "Nested PGR primer-r (SEQ ID NO: 7) and "Nested PGR primer-a" (SEQ ID NO: 8). which locate 
further inside of the primers; "PGR primer-l" (SEQ ID NO: 5); and "PGR primer-2" (SEQ ID NO: 6). The products were 
5 purified using the "QIAquick PGR Purification kit" (QIAGEN), and cloned into the pT7BIue-T vector (Novagen) by the TA 
cloning method to create a subtraction library. 

Example 2 
10 Sequence analysis 

[0022] Plasmid DNA prepared by the alkali SDS method or products of colony PGR were used as a template for 
sequence reaction. Sequence reaction was performed by the cycle-sequencing method using the ABI PRISM™ Dye 
Terminator Gycle Sequencing Ready Reaction Kit With AmplyTaq DNA Polymerase. FS, and the result was analyzed 
15 by the ABI 377 DNA Sequencer. 

[0023] Colony PGR was performed as follows. Golonies carrying recombinant vectors were directly suspended into 
PGR reaction mixtures that contain vector primers. "M13 P4-22 primer" (SEQ ID NO: 9) and "Ml 3 P5-22 primer" (SEQ 
ID NO: 10). After PGR reaction, amplified insert DNA was separated from unreacted primers and nucleotides by gel fil- 
tration or the like, and used as a template for sequencing. 
20 [0024] As a result, the clone f Is223 (261 bp) (later renamed "VRK1 ") was found to be able to encode an amino acid 
sequence (Figure 3) that contains the consensus sequence of the active site of serine-threonine kinase ([Leu, lie, Val. 
Met, Phe. Tyr, Gys]-Xaa-[His, Tyr]-Xaa-Asp-[Leu, lie. Val, Met, Phe. Tyr]-Lys-Xaa-Xaa-Asn-[Leu, lie. Val. Met. Phe, Tyr. 
Gys. Thr]-[Leu, lie, Val, Met, Phe, Tyr, Gys. Trir]-[Leu, lie. Val, Met. Phe, Tyr, Cys. Thr]) (con-esponds to amino acids at 
173-185 of SEQ ID NO: 2) (Rgure 2A). In addition, no gene registered in the database completely matches the nucle- 
us otide sequence of this clone. Thus, the gene is a novel one. 

Example 3 

RT-PGR assay 

30 

[0025] Using polyA* RNA extracted from fetal and adult livers, single-stranded cDNAs were synthesized by the stand- 
ard method with SUPERSGRIPT'" II RNase H' Reverse Transaiptase (GIBGO BRL). Some of the cDNAs were used 
as a template for RT-PGR analysis of f Is223. PGR was performed using TaKaRa Taq (TaKaRa) as Taq polymerase by 
the hot-start method, where the reaction was started by adding TaqStart™ Antibody (GLONTEGH). Primers "FLS223 

35 SI primer" (SEQ ID NO: 1 1) and "FLS223 A1 primer" (SEQ ID NO: 12) were used to amplifyfls223. 

[0026] The G3PDH (glyceraldehyde 3-phosphate dehydrogenase) gene, which is a housekeeping gene equally 
expressed in various tissues and known to be influenced only slightly by various inducers on its expression, was used 
as a control. G3PDH was amplified using the primers "hG3PDH5' primer" (SEQ ID NO: 13) and "hG3PDH3' primer" 
(SEQ ID NO: 14). RT-PGR analysis confirmed that the clone fls223 is strongly expressed in fetal liver, and its expression 

40 was also detected in adult liver (Figure 4). The full-length cDNA was then cloned for more detailed analysis of the gene. 

Example 4 

Gloning by Rapid Amplification of cDNA End (RAGE) 

45 

[0027] The Marathon^" Ready cDNA (GLONTEGH) or cDNA prepared by the Marathon™ cDNA Amplification Kit 
(GLONTEGH) was used as a template for 5* RAGE and 3' RACE (Ghenchik A. et al., CLONTECHniques X, 1. 5-8. 
1995). 

[0028] The primers described above, "FLS223 SI primer" (SEQ ID NO:. 11) and "FLS223 Al primer" (SEQ ID NO: 
so 1 2). were used for 5' RAGE and 3' RAGE of VRKI/f Is223. Using a combination of these primers and a primer AP1 (SEQ 
ID NO: 1 5), con-esponding to the adapter of the template cDNA, the reaction performed basically consisted of a reaction 
at 94°G for 2 minutes; five cycles of reactions at 94**G for 30 seconds and at 68°G 4 minutes: and 30 cycles of reactions 
at 94**G for 30 seconds. 62*0 for 1 minute, and 72^*0 for 3 minutes; and a reaction at 72*C for 10 minutes. TaKaRa Ex 
Taq (TaKaRa) was used for PGR, and the reaction was started by the hot-start method by adding TaqStart™ Antibody 
55 (GLONTEGH). After reaction, detected bands were recovered using the QIAquick Gel Extraction Kit (QiAGEN), and 
subcloned into the pT7Blue-T vector (Novagen). 

[0029] Analysis of the entire nucleotide sequence revealed that the full-length f Is223 cDNA encodes an open reading 
frame composed of 396 amino acids (refer to SEQ ID NO: 1 ). In the former part of the amino acid sequence, there exists 
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a consensus sequence of the ATP binding site of protein kinase ([Leu. lie, Val]-Gly-Xaa-Gly-Xaa-[Phe, Tyr, Trp. Met. Gly, 
Ser. Thr, Asn. His]-[Ser. Gly. Ala]-Xaa-[Leu. He. Val, Cys, Ala, Thr]-Xaa-Xaa-[Gly, Ser. Thr, Ala. Cys. Leu. lie. Val. Met. 
Phe, Tyr]-Xaa(five times or 18 times)-[Leu. lie, Val, Met. Phe. Tyr. Trp, Cys. Ser, Ttir, Ala. Arg]-[Ala. lie. Val. Pro]-[Leu. 
He. Val, Met, Phe, Ala. Gly. Cys. Lys. Arg]-Lys) (corresponds to the amino acids 43-71 described in the SEQ ID NO: 2) 
5 (Figure 2B), and a consensus sequence of the active site of serine-threonine kinase, which is also found in the original 
clone. Thus, the gene product is assumed to be a novel serine-threonine kinase. 

[0030] A homology search of the whole database revealed that the gene shows high homology to the B1 R gene prod- 
uct of the Vaccinia virus (J. Gen. Virol., 70, 3187-3201 . 1989; J. Gen. Virol.. 72. 1349-1376, 1991) (Figure 5). The B1 R 
gene encodes a protein composed of 300 amino acids and is assumed to be a serine-threonine kinase because the 
10 gene contains the consensus sequences analogous to that of the ATP binding site of protein kinase and of the active 
site of serine-threonine kinase. The full-length f Is223 cDNA and the B1 R gene showed relatively high homology over 
the entire region as well as in the kinase domain (Smallest Sum protability in Blast search = 2.7e-78). Therefore, the 
gene is named "Vaccinia virus B1R kinase related Kinase 1" (VRK1). 

[0031 ] B1 R kinase is an early gene whose expression is observed in early stages. It appears several hours after vac- 
75 cinia virus infection and is then repressed. It has been shown that in a mutant strain containing a point mutation on the 
gene, virus replication stops during DNA replication. Thus, it has been hypothesized that B1 R kinase regulates virus 
DNA replication (J. Biol. Chem.. 264. 21458-21461, 1989). 

[0032] VRK1 also exhibits an obvious homology to B1R kinase in the region outside of the serine-threonine kinase 
domain. Thus, VRK1 may participate in the regulation of cellular DNA replication or, more widely, in cell growth control, 
20 as is tiie case for BIR kinase in virus. This notion is supported by the fact tiiat tiie VRK1 genes are more strongly 
expressed in tissues such as fetal liver and the testis, where numerous actively growing cells exist. 
[0033] Furtiiermore. a public clone tiunnan EST - H80169/ which has an extremely high homology to VRK1. was 
found by searching the data base. Using the primers "RK A2 primer" (SEQ ID NO: 16) and "RK SI primer" (SEQ ID NO: 
1 7) for 5* RACE and 3* RACE, the full-length cDNA of tiie gene was cloned as described for VRK1 , and the entire nucle- 
us otide sequence was determined. As a result, it was found that the gene encodes an open reading frame composed of 
508 amino acids (refer to SEQ ID NO: 3). in which tiie consensus sequence of the active site of serine-threonine kinase 
exists. Thus, this gene may also encode a novel serine-threonine kinase. The amino acid sequence has an extremely 
high homology to VRK1. especially near the kinase domain (Figure 6). and a high homology to the vaccinia virus B1 R 
kinase (Figure 7). These suggest a close relationship between this kinase and B1 R kinase. Thus, it was named "Vac- 
30 cinia virus B1 R kinase related Kinase 2" (VRK2). 

[0034] RT-PCR confirmed tiiat VRK2 is also expressed more strongly in fetal liver than in adult liver (Figure 4). The 
primers "RK S2 primer" (SEQ ID NO: 18) and "RK A2 primer" (SEQ ID NO: 16) were used for RT-PCR. 



35 



55 



Example § 

Chromosome mapping 



[0035] Chromosome mapping of the VRK1 and VRK2 genes was performed using the GENEBRIDGE 4 Radiation 
Hybrid Panel (Research Genetics. Inc.) (Nature Genetics. 7, 22-28. 1994). DNA on the panel was used as a template 

40 for PGR. For VRK1. PCR was performed witii a combination of the above primers (TLS223 SI primer" (SEQ ID NO: 
9) and "FLS223 A1 primer" (SEQ ID NO: 12)) by a reaction at 94**C for 5 minutes; five cycles of reactions at 94*^0 for 
30 seconds and at 72**C for 2 minutes; and 30 cycles of reactions at 94**C for 30 seconds and at 68**C for 2 minutes; 
and a reaction at 72'C for 3 minutes. For VRK2, PCR was performed witii a combination of primers "VRK2 A primer" 
(SEQ ID NO: 19) and "VRK2 B primer" (SEQ ID NO: 20) by a reaction at 94°C for 3 minutes; 30 cycles of reactions at 

45 94'*C for 30 seconds. 60°C for 1 minute, and 72^*0 for 2 minutes; and a reaction at 72**C for 5 minutes. The resulting 
pattern was analyzed on a database on Internet (http://www-genome.wi.mit, edu/cgl-birVcontig/rhmapper.pl). and maps 
were obtained. 

[0036] The VRK1 gene was tiius mapped to the position between tiie STS markers "D14S265" and •'APM063XE7" 
on chromosome 14. Similarly, the VRK2 gene was mapped to the position between the STS markers 
so "CHLC.GAT^23H01 " and "D2S357" on chromosome 2. 

Example 6 

Northern blot analysis 



[0037] The expressions of VRK1 and VRK2 mRNA in various human normal tissues and tijmor cell lines were ana- 
lyzed by northern blotting (Rgure 8). 

[0038] The 5' terminal fragment of tiie VRK1 cDNA (upsti-eam region of tiie Hindi II site at nucleotide residue 546) or 
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that of the VRK2 cDNA (upstream region of the EcoRI site at position 426) was labeled with [a-^2p]dCTP by the random 
primer method using Ready-to-Go DNA labeling beads (Pharmacia), and used as a probe. Hybridization was per- 
formed at 68°C in ExpressHyb Hybridization Solution (CLONTECH) using Multiple Tissue Northern (MTN) Blot - 
Human. Human II. Human Fetal II. and Human Cell line (CLONTECH) according to the method recommended by the 
5 manufacturer. Rnal wash was done at SO^'C in 0.1 x SSC. 0.1% SDS, and the image on the filter processed through 
hybridization was analyzed with the BAS-2000II bioimaging analyzer (Fuji Photo Film). 

[00391 In Figure 8. tumor cell lines used are malignant melanoma cells "G361 lung carcinoma cells "A549." colorec- 
tal adenocarcinoma cells ''SW480,'' Burkitt's lymphoma cells "Raji." acute lymphoblastic leukemia cells (T cell) "MOLT- 
4," chronic myelogenous leukemia cells "K-562," uterocervical carcinoma cells "HeLaSS." and promyelocytic leukemia 
10 cells "HLeO". 

[0040] The results revealed that VRK1 was expressed relatively highly in fetal tissues, and extremely highly in fetal 
liver. VRK1 was expressed weakly in almost all adult tissues, but it was expressed strongly in the testis and the thymus. 
In tumor cell lines, a very strong expression of VRK1 was observed in six out of eight cell lines. 
[0041] The expression pattern at VRK2 was basically similar to that of VRK1 ; VRK2 was expressed strongly in fetal 
15 liver and the testis. Similarly, it was expressed strongly In tumor cell lines. However. VRK2 was not expressed in MOLT- 
4 cells, in contrast to the pattern of VRK1 mRNA. 

Example 7 

20 Constructing expression plasmid DNAs 

[0042] cDNA containing the entire coding region of VRK1 or VRK2 was amplified by PCR using a combination of prim- 
ers. VRK1 SI primer (SEQ ID NO: 21) and VRK1 A1 primer (SEQ ID NO: 22). or VRK2 Si primer (SEQ ID NO: 23) and 
VRK2 A1 primer (SEQ ID NO: 24) from cDNA synthesized from polyA* RNA extracted from human fetal liver The 

25 amplified product was cleaved at the A/of I site that is attached to the end of the primers and purified by agarose gel elec- 
trophoresis to obtain DNA fragments of the con-ect size. These were subcloned into the pCOS vector, which was pre- 
treated with Not\ and dephosphorylated on its ends with alkaline-phosphatase/ClAP (TaKaRa). This vector contains an 
EF1 a promoter and enables expressing cloned cDNA strongly in a broad range of mammalian cell lines. By sequencing 
the thusK)btained subclones, clones (pCOSA/RKI w, pCOSA/RK2w) without mutation such as PCR en-or were selected 

30 and used for expression as described below and for further construction of expression plasmid DNA. 

[0043] Plasmids for expressing proteins in which the anti c-Myc antibody epitope sequence (SEQ ID NO: 25) is 
attached to the C-terminus were constructed as follows. Using about 50 nanograms of the plasmid DNA and with 
pCOSA/RKIw or pCOSA/RK2w as a temptate, PCR was performed with a combination of primers. These included 
VRK1 MYC1 primer (SEQ ID NO: 26) and VRK1 MYC2 primer (SEQ ID NO: 27). or VRK2 MYC1 primer (SEQ ID NO: 

35 28) and VRK2 MYC2 primer (SEQ ID NO: 29), and cDNA with the anti c-Myc antibody epitope attached to the C-termi- 
nus of the coding sequence was amplified. KOD DNA polymerase (TOYOBO) was used as the DNA polymerase. The 
amplified product was cleaved at the SamHI site that is attached to the end of the primers and purified by agarose gel 
electrophoresis to obtain DNA fragments of the correct size. These were subcloned into the pcDNAS vector (Invitro- 
gen). which was digested with SamHI and dephosphorylated on its ends with alkaline-phosphatase/ClAP (TaKaRa). By 

40 sequencing the thus-obtained subclones, clones (pcDNASA/RKImyc. pcDNA3/VRK2myc) without mutation such as 
PCR error were selected, and used for later experiments. 

[0044] Expression plasmid DNAs for glutathione-S-transferase (GST) fusion proteins in E. coli were constructed as 
follows. Using the plasmid DNA pCOSA/RK1 w or pCOSA^RK2w as a template, the coding region was amplified by PCR 
with a combination of primers. VRK1 H3 primer (SEQ ID NO: 30) and VRK1 H4 primer (SEQ ID NO: 31). or VRK2 H3 

45 primer (SEQ ID NO: 32) and VRK2 H4 primer (SEQ ID NO: 33). The amplified product was cleaved at the BamH\ site 
that is attached to the end of the primers purified by agarose gel electrophoresis to obtain DNA fragments of the correct 
size. These were then subcloned into the pGEX-5X-1 vector (Pharmacia), which was digested with BamHI and dephos- 
phorylated on its ends with alkaline-phosphatase/ClAP (TaKaRa). By sequencing the thus-obtained subclones, clones 
(pGEXA/RKI w. pGEXA/RK2w) without mutation such as PCR error were selected and used for later experiments. 

50 [0045] A clone with a mutation introduced to the predicted ATP binding site within the kinase catalytic domain (Lys at 
position 71 in the amino add sequence of SEQ ID NO: 2 is replaced by Trp) was constructed using the Chameleon^" 
Double-Stranded Site-Directed Mutagenesis Kit (STRATAGENE) as follows. About one microgram of the pGEXA/RKI w 
plasmid DNA was mixed with primers VRK1 KW1 primer (SEQ ID NO: 34) and a selection primer. Selecti primer (SEQ 
ID NO: 35). and heat denatured by boiling for 5 minutes. Plasmid DNA and both primers containing mutation were then 

55 annealed by incubating at room temperature for 30 minutes. Next, new DNA strands were synthesized from primers by 
adding sul>strate nucleotides. DNA polymerase, etc. These were treated with Pst\ to digest wild plasmid DNA. and 
introduced into XLmutS conpetent cells. After overnight liquid culture, plasmid DNA was extracted then treated with 
PstI to digest contaminating wild plasmid DNA. It was then reintroduced into the competent cells. By sequencing sev- 
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eral single isolated colonies, a clone (pGEXA^RKI K71 W) with an introduced mutation was selected. 

Example 8 

5 Expression in mammalian cell lines 

[0046] About 1 0 micrograms of plasmid DNA. pcDNA3A/RK1 myc or pcDNA3. was introduced (transfected) into COS7 
cells using SuperFect (QIAGEN). Specifically, about 10® C0S7 cells were plated in a 10-cm dish, and cultured over- 
night. A mixture of 10 micrograms of plasmid DNA and 60 microliters of SuperFect was then added to the culture, and 

10 the culture was incubated for about 3 hours. Thereafter, the culture medium was replaced with fresh medium. The cells 
were then cultured for two more days and collected by detaching in a trypsin-EDTA solution. Cells were washed once 
in PBS, disrupted in RlPA buffer (1% NP-40, 10 mM Tris-HCI (pH 7.2), 0.1% sodium deoxychorate. 0.1% SDS. 0.15 M 
sodium chloride, 1 mM EDTA. 10 micrograms/nrd aprotinin, 1 mM PMSF), and cell extracts were obtained by centrifu- 
gation. The cell extracts were separated by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and subjected to 

15 western blot analysis using anti c-Myc antibody (SANTA CRUZ). A band of atxjut 50 kDa was specifically observed in 
cells transfected with the pcDNA3A/RK1myc plasmid DNA. indicating that VRKImyc protein is expressed (Figure 9). 
[0047] Next, about 7.5 micrograms of plasmid DNA, pCOSA/RKI w or pCOS. was transfected into NIH3T3 cells by 
the method using cationic phospholipid DOTAP (Boehringer Mannheim). After transfection, transformants were 
selected by adding G418 (GIBCO-BRL) to the culture medium to a final concentration of 500 microg rams/ml. Total RNA 

20 was prepared from each pool of transformants by the method using ISOGEN (Wajo Junyaku). The total RNA was then 
subjected to northern blot analysis using the VRK1 cDNA as a probe. The results confirmed that VRK1 mRNA was 
expressed in a pool of cells obtained by transfection with the pCOSA/RKIw plasmid DNA (Figure 10). These pools of 
cells were examined for the ability to form colonies on soft agar (colony assay). To this end, 2 x lO'* cells were sus- 
pended in Dulbecco's Modified Eagle Medium (DMEM) containing 10% fetal bovine serum and 0.4% thawed Sea- 

25 Plaque agarose (TaKaRa), and overlaid on a bottom agarose which was made of 0.53% SeaPlaque agarose. 1 0% fetal 
bovine serum, and DMEM. After two-week culturing, the pool of cells obtained by transfection with the pCOSA/RK1w 
plasmid DNA formed a number of colonies that were obviously larger than those formed in the pool of cells obtained by 
transfection with the pCOS plasmid DNA. This suggests that overexpression of VRK1 confers abnormal growth activi- 
ties on cells (Figure 11). 

30 

Example 9 

Expression of VRK1 protein in E.co// and kinase assay 

35 [0048] Both wild VRK1 protein and mutant VRK1 protein were expressed in Exoli as a fusion protein with GST protein 
and purified. The E.coli DH5a strain cells carrying the above-described plasmid DNA. pGEXA/RKIW. or 
pGEXA/RK1K71W were cultured overnight at 37°C in 10 ml 2xYT medium. Some of the culture was diluted '100 -fold 
with fresh 2xYT medium and cultured at 37'*C until the OD value at 600 nm reached 0.6. IPTG (isopropyl-p-D(-)-thioga- 
lactopyranoside) was then added to the culture to a final concentration of 0.1 mM. and the culture was incubated further 

40 for several hours. The E. coli cells were collected by centrifugation, resuspended in PBS containing 1% Triton X-100 
and 1% Tween 20, and subsequently disrupted by sonication to solubilize proteins. From the solubilized samples, wild 
VRK1 protein and mutant VRK1 . which were expressed as a fusion protein with GST, were purified by affinity chroma- 
tography using glutathione Sepharose4B (Pharmacia). These proteins were subjected to SDS-PAGE and stained with 
Coomassie Brilliant Blue (CBB) to confirm their purity (Figure 12). GST protein and GST-lkBa protein were prepared in 

45 the same manner. 

[0049] Kinase assay was perfbmned on a total of 50 microliters of a reaction mixture containing 0.2 micrograms of wild 
or mutant VRK1 protein. 50 mM Tris-HCI (pH 7.2). 1 mM dithicthreitol (DTT), 2 mM or 10 mM of divalent cation (Mg. Mn, 
Zn. Ca). a maximum of 5 micrograms of substrate protein, and 1 microliter of [y-^^Pj-ATP (3000 Ci/mM. 10 mCi/ml 
[Amersham]). In some experiments, another buffer system containing 40 mM Hepes (pH 7.4), 1 mM DTT, and 2.5 mM 
so EGTA was used. 

[0050] Specifically the reaction was first performed in the presence of 10 mM Mg at 37°C for 30 minutes using, as a 
protein substrate, histone (Nakalai), casein (Sigma), myelin basic proteinyMBP (Sigma). GST. GST-lkBa, or IkBa C-ter- 
minal peptide (SEQ ID NO: 36). The reaction mixture was subjected to SDS-PAGE. and the radioactivity of phosphor- 
ylated proteins was analyzed with a bAS2000ll bioimaging analyzer (Fuji Photo Film). The result indicates that wild 
55 VRK1 protein phosphorylates casein and GST-lkBa (Figure 13). In contrast, no phosphorylation was observed in reac- 
tions with mutant VRK1 carrying mutation in the predicted ATP binding site. This indicates that VRK1 is a protein kinase 
that contains a typical catalytic domain. In addition. GST protein was not phosphorylated by VRK1 . suggesting that the 
phosphorylation of GST-lkBa protein by VRK1 occurs within IkBa protein but not in GST moiety. 
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[0051] IkBa is said to negatively regulate the function of transcription factor NF-kB by fonning a complex with it. In 
addition, it is widely accepted that IkBa is Inactivated by se!f-phosphorylation, immediately thereafter undergoes prote- 
olysis, thereby liberating and activating NF-kB. NF-kB is supposed to be a central transcriptional regulator that induces 
a broad range of immune reactians and inflammatory reactions. Therefore, a kinase that phosphorylates IkBa is impor- 

5 tant as a target molecule of anti-inflammatory drugs. Since VRK1 strongly phosphorylates IkBa in vitro, VRK1 probably 
participates in the activation of NF-kB by phosphorylating IkBa in vivo as well. Therefore, rt is possible to anticipate anti- 
inflammatory effects or immunosuppressive effects by inhibiting VRK1 kinase activity, or by reducing its protein amount. 
[0052] Next, the requirement for divalent cations in phosphorylation by VRK1 was examined (Figure 14). In the pres- 
ence of various divalent cations (Mg. Mn, Zn. and Ca) at a final concentration of 2 mM or 10 mM. kinase reactions were 

10 performed using casein as a substrate protein. The result showed that VRK1 exhibits phosphorylation activity in the 
presence of all divalent cations except for Zn. However, the levels of activity were different; VRK1 exhibited especially 
strong activity in the presence of Mn. 

Example 10 

15 

Preparation of an antibody against VRK1 protein 

[0053] A peptide (SEQ ID NO: 37) con-esponding to the C-terminal sequence of the deduced VRK1 amino acid 
sequence was synthesized (Sawady Technology) ar»d conjugated to Keyhole limpet hemocyanin (KLH) at its amino-ter- 

20 minal cysteine mediated by m-maleimidobenzoyi-N-hydroxy-succinimide ester (MBS). This was used as an antigen to 
immunize rabbits, and antisera were obtained. Antibodies that specifically react with the peptide were purified from the 
antisera by affinity chromatography using Cellulofine (Seikagaku Corporation) conjugated with the peptide. Western 
blot analysis using extracts of K562 cells, which were confirmed by northern blot analysis to strongly express VRK1 . 
detected a single band with a molecular weight of approximately 50 kDa, indicating that VRKI protein is specifically rec- 

25 ogntzed by the antibody (Figure 15). 

Industrial Apolicabilitv 

[0054] The serine-threonine kinase genes isolated by the present inventors show significant homology to a vaccinia 
30 virus gene that is thought to be involved in DNA replication and are strongly expressed in actively growing cells. Fur- 
thermore, overexpression of proteins encoded by the genes remarkably enhances cell proliferation activity. Thus, the 
isolated serine-threonine kinase genes are assumed to participate in the regulation of cell growth. Therefore, it is pos- 
sible to develop cell growth inhibitors or antitumor agents based on a novel mechanism by screening drugs targeted on 
the genes (such as antisense DNA). or drugs which can regulate either the expression of the genes or the activity of 
35 the proteins encoded by the genes of the present invention. 



40 



45 



so 



55 



BNSDOCID: <EP 0960938A1J_> 



10 



EP 0 960 938 A1 



Sequence Li sting 

{ 1 > Name of Applicant : Chugai Research Institute for Molecular Medicine, 
Inc. 

(2) Title of the Invention: Novel Serine-threonine Kinase Gene 

(3) Reference Number: CI-805PCT 

(4) Application Number: 

(5) Filing Date: 

{ 6 ) Country where the priority application was filed and the application 
number of the priority application: Japan, No. Hei 8-357864 

(7) Priority dates December 26, 1996 

(8) Number of Sequences: 37 

SEQ ID NO: 1 

SEQUBNCS I<ENGTH: 1662 

SEffUZNCB TYPE: nucleic acid 

STRAKDEDNESSS: double 

tOPOliOGY; linear 

MOLECULE TYPE: cDMA to sHOaA 

FEATURE: 

NAKE/KETs CDS 

LOCATTOK: 76.. 1266 
SEQUENCE DE5CKIPTIOH: 

CCGAGTTACQ AGTCGGCGAA AGCG6CGOGA AOTTCGHACT GGGCAGAACG CCSACGGGTCT 60 
GCGGCTTAGG TGAAA ATG OCT CGT GTA AAA GCA GC7 CAA GCT GGA AGA CA6 111 

Mat Pro Arg val Lys Ala Ala Gin Ala Gly Arq Gin 
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15 10 
AGC TCT GCA AAG AGA CAT CTT GCA GAA CAA TT7 GCA GTT GGA GAG ATA 159 
Ser B&r Ala Lye Arg His Leu Ala Glu Gin Pbe Ala Val Gly Glu He 

15 20 25 

ATA ACT GAC ATG GCA AAA AAG CAA TGG AAA GTA GGA TTA CCC ATT CGC 207 
He Thr Asp Mat Ala Lya Ly* Glu Trp l.y« Val Gly Leu Pro lie Gly 
15 30 35 40 

CAA GGA GCC TTT CGC TCT ATA TAT CTT GCT CAT ATG AAT TCT TCA GAG 255 
Cln oLy Gly Phe CXy Cys xie Tyr Leu Ala Asp Met Asn Ser Ser Glu 
45 50 55 60 

TCA GTT GGC ACT CAT CCA CCT TGT GTT GTA AAA GTG GAA CCC ACT GAC 303 
Smr Val Cly Ser Acp Ala Pro Cyo Val Val Ly9 Val Glu Pro Sarr A9p 

€5 70 75 

AAT GGA CCT CTT TTT ACT GAA TTA AAG TTC TAC CAA CGA CCT GCA AAA 351 
Acn Oly Pro Leu Phe Thr GXu X^eu Lya Phe Tyr Gin Arg Ala Ala Lya 

80 85 90 

CCA GAG CAA ATT CAG AAJL TGG ATT CCT ACC CGT AAG CTC AAG TAC CTC 399 
pro Clu Gin He cln Ly» Trp He Arg Thr Arg Lya L«u Lya Tyr Leu 

95 100 105 

GCT GTT CCT AAG TAT TGC GGG TCT GGT CTA CAT GAC AAA AAT GGA AAA 447 
Gly Val Pro Lya Tyr Trp Gly Ser Gly Leu Bis Asp Lya Asn Cly Lye 

110 115 120 

ACT TAC ACC TTT ATG ATA ATG GAT CGC TTT CCC ACT GAC CTT CAG AAA 495 
Ser Tyr Arg phe Met He Met Asp Arg Phe Gly Ser Aap Leu Gin Lys 
125 130 135 140 
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ATA TAT GAA CCA AAT CCC AAA AGG TTT TCT CGG AAA ACT OTC TTG CAG 543 
lie Tyr ciu JUa Aan Ser Arg Lys~Tlir Val I-ea Gin 

145 150 155 

CTA AGC TTA AGA ATT CT6 GAT ATT CTG GAA TAT ATT CAC GAG CAT GAG 591 
hmjx Ser Lau Arg He I.eu Asp lie l^u Glu Tyr He His Glu Hi» Glu 
160 165 170 

15 TAT CTG CAT GGA CAT ATC AAG GCC TCA AAT CTT CTT CTG AAC TAC AAG 639 

Tyr Val Hie Gly Asp He X«y8 Ala Ser Asn Leu Leu Leu Asn Tyx Lys 

175 180 185 

AAT CCT GAC CAG CTG TAG TTG CTA GAT TAT GQC CTT GCT TAT CGG TAC 687 
Aea Pro Asp Gin Val Tyr Leu Val Asp Tyr Gly Leu Ala Tyr Atg Tyr 

19C 195 200 

TGC CCA GAA GGA GTT CAT AAA GAA TAC AAA GAA CAC CCC AAA AGA TGT 735 
Cys Pro Glu Gly Val His Lys Glu Tyr Lys Glu Asp Pro Lys Arg Cys 
205 210 215 220 

CAC GAT GGC ACT ATT GAA TTC ACG AGC ATC CAT OCA CAC AAT GGT GTG 783 
»±» Asp Gly Thr He Glu Phe Thr Ser He Asp Ala His Asn Gly Val 

225 230 235 

CCC CCA TCA AGA CGT CGT GAT TTG GAA ATA CTT GGT TAT TGC ATG ATC 831 
Ala Pro Ser Arg Arg Gly Asp Leu Glu He Leu Gly Tyr Cys Mer He 

240 245 250 

CAA TGG CTT ACT GGC CAT CTT CCT TGG GAG GAT AAT TTG AAA CAT CCT 879 
61b Trp Leu Thr Gly Kis Leu Pro Trp Glu Asp Asn Leu Lys Asp Pro 

255 260 265 

AAA TAT GTT AfiA GAT TCC AAA ATT AGA TAC AGA CAA AAT ATT CCA AGT 927 

55 
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15 



20 



Uya Tyr Val Ar? Asp Ser Lys He Arg Tyr Arg Glu Asn He Ale Sor 

270 275 2B0 

TTG ATG GAC AAA TGT TTT CCT OAG AAA AAC AAA CCA QQT CSAA ATT GCC 975 
hmM Met: Asp Lys Cys Phe Pro Clu Lya Asn Lys Pro Gly Glu He Ala 
265 290 295 300 

AAA TAC ATG GAA ACA CTG AAA TTA CTA GAC TAC ACT GAA AAA CCT CTT 1023 
Lys Tyr Mec Glu Thr Val Lys i«eu Leu Asp Tyr Thr Glu Lys Pro L«u 

305 310 315 

TAT GAA AAT TTA COT GAC ATT CTT TTG CAA GGA CTA AAA GCT ATA GGA 1071 
Tyr Glu Asn Leu Arg Asp He h&a Leu Gin Gly Leu Lys Ala He Gly 

320 325 330 

AGT AAG GAT GAT GGC AAA TTG GAC CTC AGT GTT GTG GAG AAT GGA G6T 1119 
Ser Lys Asp Asp Gly Lys Leu Asp Leu Ser vaX veX Glu Asn Gly Gly 

335 340 345 

no AAA GCA AAA ACA ATA ACA AAG AAG CGA AAG AAA CAA ATT GAA GAA 1167 
Leu Lys Ale Lys Thr He Thr Lys Lys Arg Lys Lys Glu He Glu Glu 

350 355 360 

AGC AAC GAA CCT G6T GTT GAA GAT ACG GAA TGC TCA AAC ACA CAG ACA 1215 
Ser Lye Glu Pro Gly Vel Glu Asp Thr Glu Trp Ser Asn Thr Gin Thr 
365 370 375 360 

GAG GAG CCC ATA CAG ACC CGT TCA AGA ACC AGA AAG AGA GTC CAfi AAG 1263 
Glu Glu Ale He Gin Thr Arg Sex Arg Thr Arg Lys Arg Val Gin Lys 
385 390 395 

50 TAATTCACAT GCTGTCAACC AGATTTCCTT TTCTTTGTTT TCTTTTQACT TTTTTCTCCT 1323 

TTTCTGTTAG AACTGTTTTA TTTTCCTGTG AGTCTTGCGA GGTGGAATTA ATGATTAAAT 1383 

55 
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25 



45 



SO 



ACTCXTGTGT TCAGAAAACA TAAACTTTTT TTATAAAAAT ATTTTCTACA ATTCATTAAA 1443 
(;G«AATTTA~TGJUtilTTTGA AAATCTTCAG GTTATACTCC TTAAOTTATC CC^^ ~1503 
GTGTTTGTGA TGTTTTGGAG TACAXATATA TGAAAATTAT TATGACACCC ACTTTTCTAA 1563 
TCATTGTACA TTTCXCAGAG TGGATAAAAA TGTTTGACAA AGTCCTCACT TTTAAGGAAA 1623 
TGCAAAGCrr AAAATAAAAC TCTCTTTTGT TTGATGCAO 1662 



15 SBQ ID NO: 2 

SSQUENCE I^BNGTB: 396 
SEQUENCE TYPE: amino acid 
TOt^OLOGYi linear 
MOI<ECUI*E TYPE: protein 
SEQUENCE DESCRIPTIOH: 

Mee Pro Arg Val Z.ys Ala Ala Gin Ala Gly Arg Gin 
15 10 

30 

Ser 5«r Ala tyo Ar? His t*eu Ala Glu Gin Phe Ala val Gly Glu lie 

35 

15 20 25 

lie Thr Asp Met Ala l.yB Lya Glu Trp Z.ya Val Gly Leu Pro lie Gly 
40 30 35 40 

Gin Gly Gly Phe Gly Cys He Tyr teu Ala Asp Met Aen Ser Ser Glu 
45 50 55 60 

Ser Val Gly Ser Asp Ala Pro Cys Val Val Lys Val Glu Pro ser Asp 

65 70 75 

Asn Gly Fro Phe Thr Gla Leu Lya Phe Tyr Gin Azrg Ala Ala Lys 
SO as SO 

55 
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Pro Glu Gin II Gin Lys Trp II Arg Thr Arg Lys Lou Lya Tyr Leu 

95 100 105 

Gly V4X Pro Lys Tyr Trp Gly 5«r Gly Leu His Asp Lys Asn Gly Lys 

110 115 120 

sex Tyr Arg Pbe Het lie Met Asp Arg Phe Gly Ser Asp Leu Gin Lys 
125 130 135 140 

15 Zle Tyr Glu Ala Aen Ala Lya Arg Phe Ser Arg Lys Thr Val Leu Gin 

145 150 155 

Leu Ser Leu Arg lie Leu Asp lie Leu Glu Tyr lie His Glu His Glu 

1€0 165 170 

Tyr Val Hia Gly Asp He Lys Ala Ser Asn Leu Leu Leu Asn Tyr Lya 

175 180 185 

Aan Pro Aap Gin Val Tyr Leu Val Asp Tyr Gly Leu Ala Tyr Arg Tyr 

190 195 200 

Cys Pro Glu Gly Val Hi. a Lya Glu Tyr i*yfl Glu Aap Pro Lye Arg Cys 
205 210 215 220 

Hia Aap Gly Thr He Glu Phe Thr Ser Xle Aep Ala Hie Aan Gly Val 
225 230 235 

40 

Ala Pro Ser Arg Arg Gly Asp Leu Glu He Leu Gly Tyr Cya Met He 
240 245 250 

45 

Gin Trp Leu Thr Gly His Leu Pro Trp Glu Asp Acn Leu Lys Asp Pro 
255 260 26S 

50 Lya Tyr Val Arg Asp Ser Lye He Arg Tyr Arg Glu Asn He Ala Ser 

270 275 280 

55 



30 



35 



16 

BNSDOCID: <EP 0960938A1 J_> 



EP 0 960 938 A1 



10 



Leu Het Asp X*ys Cys Phe Pro Glu l.ys Asn Lys Pro Gly Glu lie Ala 

285 2^0 295- — 300 

Ly» Tyr Met Glu Thr val liys I*mi jjbu Asp Tyr Thr Glu tya Pro Leu 

305 310 315 

Tyr Glu Asn Iiau Arg Asp He l^eu Leu Gin Gly Leu Lya Ala Zle Gly 
320 323 330 

15 5mz Lys Asp Asp Gly Lys Leu Asp Leu Ser val Vsl Glu Aan Gly Gly 

335 340 343 

Leu Lys Ala Lys Thr Zle Thr Lys Lys Arg Lys Lys Glu He Glu Glu 

350 355 360 

Ser Lys Glu Pro Gly Val Glu Asp Thr Glu Trp Ser Asn Thr Gin Thr 
365 370 375 380 

Glu Glu Ale He Gin Thr Arg Ser Arg Thr Arg Lye Arg Val Gin Lys 
385 390 395 

30 



20 



25 



35 



SEQ ID KO: 3 
SEQUENCE LEHGTB: 1833 
SEQUENCE TYPE: nucleic acid 
40 STRANDEDNESSSt double 

TOPOLOGY: linear 
MOLECULE TYPE: CDMA ro loRNA 
FEATURE: 

NAME/KEY: CDS 
LOCATION: 131*, 1657 
SEQUENCE DESCRIPTIONS 

£5 



45 



SO 
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10 



20 



CTC3CACTGCG AGGCCGACac AGCTCCAGAG AAGTTXGGCA GGTCCTAG<SG AGGGCAGGCT 60 
CGACTGCTGC GCCCGCCTCC CCGCGGGACT GTAGGCXCGG GGGCTCCGCC TCGTCGCAGC 120 
GGCAGAA6TG ATG CCA CCA AAA AGA AAT GAA AAA TAC AAA CTT OCT ATT 169 
Met Pro Fro Dye Arg A»n Glu Lys Tyr l-y« I»«u Pro lie 
15 10 
CCA TTT CCA GAA 06C AAG GTT CTG GAT GAT AT6 GAA OGC AAT CAO TGG 217 
15 Pro Pbtt Pro Glu Qly i«y« val Leu Asp Asp Hat Glu Gly Asn Gin Trp 

15 20 25 

GTA CTG GGC AAG AAG ATT GGC TOT GGA GGA TTT GGA TTG ATA TAX TTA 265 
Val Leu Gly Lys Lys lie Oly Ser Gly Gly Phe Gly I/eu lie Tyr l^n 
30 35 40 45 

25 Q„ ACA. AAT AAA CCA GAG AAA GAT GCA AGA CAT CTA GTA AAA 313 

Ala Phe Pro Thr A^n Lys Pro Glu Lys Asp Ala Ar^ His Val Val Lyo 

50 55 60 

CTC GAA TAT CAA GAA AAT GGC CCC TTA TTT TCA GAA CTT AAA TTT TAT 361 
Val Glu Tyr Gin Clu Asn Gly Pro Lau Pbe Ser Glu Leu Lya Phe Tyr 

65 70 75 

CAC AGA GTT GCA AAA AAA GAC TCT ATC AAA AAG TGG ATA GAA CGC AAA 409 
Gin Arg Val Ala Lys Lys Asp Cya I la Lys Lys Trp I la Glu Arg Lys 
BO es 90 

CAA CTT GAT TAT TTA GGA ATT CCT CTG TTT TAT SCA TCT G6T CTC ACT 457 
CXn Lau Asp Tyr Leu Gly lie Pro Leu Phe Tyr Gly Ser Gly Leu Thr 

95 100 105 

GAA TTC AAG GGA AGA ACT TAC AGA TTT ATG GTA ATC GAA AGA CTA GGA 505 
Glu Phe Lys Gly Arg Ser Tyr Arg Phe Met Val Met Glu Arg Lau Gly 

55 
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110 115 120 125 

ATA GAT TTA CAC AAG ATC TCA GGC CAC AAT GCT ACC TTT AAA AAC TCA 553 
II© Asp Leu Gin i*y« Xlo 5cr Gly Gin Asn Oly Thr Pike Lys Lys Ser 

130 135 140 

ACT GTC CTG CAA TTA GGT ATC CGA ATC TTO GAT GTA CTG GAA TAT ATA €01 
Thr Val lieu Cln LeU Gly lie Arg Met Xieu A«p V«l Leu Glu Tyr lie 

145 150 155 

CAT GAA AAT GAA TAT GTT CAT GGT GAT GTA AAA CCA GCA AAT CTA CTT 649 
His Glu Asn Glu Tyr val His Gly Asp Val Lys Ala Ala Asn Leu Leu 

1(0 165 170 

TTG GGT TAC AAA AAT CCA GAC CAG GTT TAT CTT GCA GAT TAT GGA CTT 697 
Leu Gly Tyr Lys Aan Pro Asp Gin Val Tyr Leu Ala Aap Tyr Gly Leu 

175 180 185 

TCC TAC AGA TAT TGT CCC AAT GGG AAC CAC AAA CAC TAT CAC CAA AAT 745 
Ser Tyr Arg Tyr Cys Pro Asn Gly Asn His Lys Gin Tyr Cln Glu Asn 
190 195 200 205 

OCT AGA AAA GGC CAT AAT GGG ACA ATA GAG TTT ACC AGC TTG GAT GCC 793 
Pro Arg Lys Gly His Asn Gly Thr lie Glu Phe Thr Ser Leu Asp Ala 

210 215 220 

CAC AAG CGA CTa CCC TTG TCC AGA CGA AGT GAC GTT GAG ATC CTC GGC 841 
His Lys Gly val Ala Leu Ser Arg Arg Ser Asp Val Glu lie Leu Gly 

225 230 235 

TAC TGC ATC CTG CGG T6G TTG TGT GGC AAA CTT CCC TGG GAA CAG AAC 889 
Tyr Cys Met Leu Arg Trp Leu eye Gly Lys Leu Pro Trp Clu Gin Asn 
240 245 250 
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CTC AAG GJWC CCT STG GCT GTG CAG ACT GCT AAA ACA AAT CTG TTG GAC 937 
X«eu Lys Asp Pro vaX Ala Val Gin Thr Ala Lys Thr Asn I.e\i Leu Xap 

255 260 265 

GAG CTC CCC CAG TCA GTG CTT AAA TOG GCT CCT TCT GGA ACC ACT TGC 985 
CXu X«eu Pro Gin Ser Val Leu I.ys Trp Ala Pro Sar Gly Ser Ser Cya 
270 275 280 285 

TGT GAA ATA GCC CAA TTT TTG GTA TGT GCT CAT ACT TTA OCA TAT CAT 1033 
Cys Glu Il« Ala Gin Phe Leu Val Cys Ala Hia Ser Leu Ala Tyr Asp 

290 29S 300 

GAA AAG CCA AAC TAT CAA GCC CTC AAG AAA ATT TTG AAC CCT CAT GGA 1061 
Glu Lya Pro Aen Tyr Gin Ala Lys Lya lie Leu Asn Pro His Gly 

305 310 315 

ATA CCT TTA GGA CCA CTG GAC TTT TCC ACA AAA GGA CAG AGT ATA AAT 1129 
lie Pro Leu Gly Pro Leu Asp Phe Ser Thr Lya Gly Gin Ser lie Asn 

32 0 325 330 

GTC CAT ACT CCA AAC AGT CAA AAA GTT GAT TCA CAA AAG GCT GCA ACA 1177 
Val His Thr Pro Asn Ser Gin Lys Val Asp Ser Gin Lye Ala Ala Thr 

335 340 345 

AAG CAA GTC AAC AAG GCA CAC AAT AGG TTA ATC GAA AAA AAA GTC CAC 1225 
l»ys Cln Val Asn Lys Ala His Aan Arg Leu lie Clu Lys Lys val Hie 
350 355 360 365 

AGT GAG AGA AOC OCT GAG TCC TCT GCA ACA TGC AAA GTG CAG AAA CAC 127 3 

Ser Glu Arg Ser Ala Glu Ser Cys Ala Thr Trp Lys val Gin Lya Glu 

370 375 3e0 

GAC AAA CTG ATT GGA TTG ATG AAC AAT GAA GCA GCT CAG GAA AGC ACA 1321 
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Glu Lys Leu lie Gly X^u Met Aen Aan Glu Ala Ala Oin Clu Ser Thr 

3B5 390 ^95- ~ - 

AGG ASA AGA CAG AAA TAT CAA GAG TCT CAA GAA CCt TTG AAT GAA GTA 1369 
Arg Arg Arg Gin l.ys Tyr Gin Glu Ser Glti Glu Pro Leu Adzi Glu Val 

400 405 410 

AAC ACT TTC CCA CAA AAA ATC AGC TAT ACA CAA TTC CCA AAC TCA TXT 1417 
75 Asn Ser Phe Pro Gin liys He Ser Tyr Thr Gin phe Pro Asn Ser Phe 

415 420 425 

TAT GAG CCT CAT CAA GAT TTT ACC ACT CCA GAT ATA TTC AAG AAG TCA 1465 
Tyr Glu Pro His Gin A«p Phe Thr Ser Pro Asp He Phe Lya Lys Ser 
430 435 440 445 

AGA TCT CCA TCT TGG TAT AAA TAC ACT TCC ACA GTC AGC ACG GGG ATC 1513 
Arg Ser Pro Ser Trp Tyr liya Tyr Thr Ser Thr VeQ Ser Thr Oly He 

450 455 460 

ACA GAC TTA GAA AGT TCA ACT GQA CTT TGG CCT ACA ATT TCC CAG TTT 1561 
Thr Asp X«eu Glu Ser Ser Thr Gly Leu Trp Pro Thr He Ser Gin Phe 

465 470 475 

ACT CTT AST GAA CAG ACA AAC CCA GAT GTT TAT TAT TAT CGC ATC ATC 1609 
Thr X«eu Ser Glu Clu Thr Asn Ale Asp Val Tyr Tyr Tyr Arg lie He 

480 4B5 490 

ATA CCT GTC CTT TTG ATG TTA GTA TTT CTT GCT TTA TTT TTT CTC 1654 
He Pro Val Leu lieu Hot Leu val Phe Leu Ala Leu Phe Phe Leu 

495 500 505 

TGAACATGAT ACCAAAATTC CTTTTGATAA TTTTTTAAGT TTCCAGCTCT TCACCGAAAT 1714 
OTTGTATTCr TATTTCACTC TTTCCTTCCA CACATTTTTA AGGTAATTCC CTTTAAAAAG 1774 
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AGAACATATT TTAACAAAGT TTGTC ACAC TCTAAAAAAT AAAATTOCTT TCTACTACT 1633 

SEQ ID NO: 4 
SEQUBNCS LENGTH! 508 
SSgUENCE TYPE: amino acid 
TOPOLOGY: linear 
HOLBCXTLE TYFEt protein 
SEQUENCE DESCRIPTION: 

M9t Pro Pro Lys Arg Aan Glu Lya Tyr Ly» Leu Pro lie 
15 10 
Pro Phe Pro Glu Gly Ly« Val Leu Aep Aep Met Glu Gly Aan Gin Trp 

15 20 25 

val Leu Gly Ly» Lya lla Gly Ser Gly Gly Phe Gly Lou Ilo Tyr L«u 
30 35 40 45 

Ala Phe Pro Tbr Aan Lys Pro Glu Lya Asp Ala Arg His Val Val Lye 

50 55 60 

Val Glu Tyr Gin Glu Asn Gly Pro Leu Phe Sar Glu Leu Lya Phe Tyr 

65 70 75 

Gin Arg Val Ala Lya Lys Asp eye He Lye Lya Trp lie Glu Arg Lys 

80 85 90 

Cln Lau Asp Tyr Leu Gly lie Pro Leu Phe Tyr Gly Ser Gly Leu Thr 

95 100 105 

Glu Phe Lya Gly Arg 5er Tyr Arg Phe Met Val Met Glu Arg Leu Gly 
110 115 120 125 

He Asp Leu Gin Lys He Ser Gly Cln Asn Cly Thr Phe Lya liys Sor 
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130 135 140 

Thr Val Ii6U Gin I.«u Gly lie Arg Het Leu Asp Val I*on Glu tyr lie 

145 ISO 15S 

His Glu ASH Glu Tyr val His Qly Asp Val Lys Ala Ala Asti l.eu Leu 

160 165 170 

Lou Gly Tyr Lya Aav Pro Aap Gin Val Tyr Leu Ala Asp Tyr Gly liau 

175 180 185 

Sar Tyr Arg Tyr Cys Pro Asn Gly Aan His Lys GLn Tyr Gin Glu Aan 
190 195 200 205 

Pro Arg Lys Gly His Asn Gly Thr Il« Glu Phe Thr Ser I<eu Asp Ala 

210 215 220 

His Lys Gly Val Ala Leu Ser Arg Arg Sar Asp Val Glu lie Leu Gly 

22S 230 235 

Tyr Cys Met Leu Arg Trp Leu Cys Gly Lys Leu Pro Trp Glu Gin Asn 

240 245 250 

Leu Lys Asp Pro val Ala Val Gin Thr Ala Lys Thr Asn Leu Leu Asp 

255 260 265 

Glu Leu Pro Gin Ser Val Leu Lys Trp Ala Pro Ser Gly Ser Ser Cys 
270 275 290 285 

Cys Glu Zle Ala Gin Phe Leu Val Cys Ala His Ser Leu Ala Tyr Aap 

290 295 300 

Glu Lys Pro Asn Tyr Gin Ala Leu Lys Lys lie Leu Asn Pro His Gly 

305 310 315 

lie Pro Leu Gly Pro Leu Asp Phe Ser Thr Lys Gly Gin Ser He Aan 
320 325 330 
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V«l His Thr Pro Aan Ser Gin I.y8 Val Asp Ser Gin Lys Ala Ala Thr 

335 340 345 

Lys Gin Val Aan Lys Ala fiis Aan Arg Leu lie Glu Lys Lys Val His 
350 355 360 365 

Ser Glu Arg Ser Ala Glu Ser Cys Ala Thr Trp Lys Val Gin Lys Glu 

370 375 380 

Glu Lys Leu He Gly Leu Mel; Asn Aan Glu Ala Ala Gin Glu Ser Thr 

385 390 395 

Arg Arg Arg Gin Lya Tyr Gin Glu Ser Gin Glu Pro Leu Asn Glu Val 

400 405 410 

AsD ser Phm Pro Gin Lya lie Ser Tyr Thr Gin ?he Pro Aan Ser Phe 

415 420 425 

Tyr Glu Pro His Gin Asp Phe Thr Ser Pro Asp lie Phe Lys Lys Ser 
430 435 440 445 

Arg Ser Pro Sar Trp Tyr Lya Tyr Thr Ser Thr Val Ser Thr Gly lie 

450 455 460 

Thr Asp Leu Glu Ser Ser Thr Gly Leu Trp Pro Thr lie ser Gin Phe 
465 470 475 

40 Thr Leu Ser Glu Glu Thr Asn Ale Asp Val Tyr Tyr Tyr Arg lie He 

480 465 490 

lie Pro Val Leu Leu Met Leu Val Phe Leu Ala Leu Phe Phe Leu 
495 500 505 
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SKQ ID WO: 5 
SBQUEHCE LENGTH: 22 
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SEQUENCE TYPE* nucleic acid 
STEIAHDEDNESSS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, synthetic 0£BV 
SB^^JEHCE DESCRIPTION: 
CTJAT^COAC TC&CTATAGG GC 

SEQ ID NO I 6 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDMESSS: single 
TOPOLOGY: llnear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
TGTAGCGTGIk AGACGACAGA A 

SEQ ID DOS 7 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS5: dingle 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, synthetic DMA 
SEQUENCE DESCRIPTION: 
TCGAGCGCCC 6CCCGGGCAGG T 
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SEQ ID KO: 8 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSSs single 
rOPOLOGTt linear 

MOLECULE TTPS9 other nucleic acid, synthetic UNA 
SEQUENCE DESCRIPTION: 
AGOGCCTGOT GCGGAGGGC6G T 

SEQ ID NO I 9 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSSs tingle 
TOPOLOGY s linear 

MOLECULE TYPE* other nucleic acid, synthetic ONA 
SEQUENCE DESCRIPTZONt 
CCAGGCTTTT CCCACTCACG AC 

SEQ ID NOi 10 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDBDNESSSi single 
TOPOLOGY: linear 

MOLECULE TYPES other nucleic acid, synthetic oNA 
SEQUENCE DESCRIPTION: 
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SEQ 10 KO: 11 
SBQUSMCB LEHGTH: 24 
SEQUEHCE TYPE: nucleic acid 
SntXNDEDKESSSs single 
TOPOLOGY s linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE OfiSCRIPTION: 
TGTAGTTCAG AAGMOATTT CAGG 



SEQ ID NO: 12 
SEQUENCE LENGTH} 24 
30 SEQUENCE TYPE: nucleic acid 

STIUkNDEONESSS s single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION t 
ATAJLTGGATC GCTTTGGGAC TGAC 



SEQ ID NOs 13 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE I nucleic acid 
STRANDBDNES5S: single 
TOPOLOGY: linear 
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MOLECULE TYP£; thar nucl ic acid, 9ynt:hetic DMA 
5EQUEHCS DESCRXPTlONr 
TGAAGGTCQG AGTCAACGGA TTTGGT 

SSQ ID NO: 14 
SEQUENCE LENGTHS 24 
SSQUENCS TYPE: nuclelc acld 
STRANDEDKESSS : Single 
TOPOLOGY; linear 

MOLECULE TYPE: otihar nucleic acid, synt:hetic DNA 
SEQUENCE DESCKXPTIOK: 
CATGTGTCCC ATGAGCTCCA CCAC 

6EQ ID NOs 15 
SEQUENCE LENGTH: 27 
SEQUENCE TYPEt nucleic acid 
STRANDEDKESSS: Single 
TOPOLOGY: linear 

MOLECULE TYPEr o^her nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CCATCCTAAT ACGACTCACT ATAOGGC 

SSQ ID NOi 16 
SEQUENCE LENGTH: 24 
SEQUBNCE TYPE: nucleic acid 
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STRAHDEDNESSS: Single 
TOPOLOGY: linear 

HOLECXJLE TYPE: Other nuclsic acid, syathetic DKA 
SEQUENCE DESCRIPTION: 
GGATTTTCCT GATACTGTTT GTGG 

SEQ ID NOc 17 
SSQUENCS UNGtH< 24 
SBQasNCE TYPE: nucleic acid 
STHANDEDNESSS: single 
TOPOXiOGYs linear 

MOIiECUIiB TYPE: Other nucleic acid, synthetic DNA 
SEQOENCB DESCRIPTIONS 
ACCACAAACA GTAXCAGCAA AATC 

SSQ ID NO: 18 
SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
STRAHDEDNESSS: single 
TOPOLOGY « linear 

MOLECULE TYPE; Other nucleic acidr synthetic DNA 
SEQUENCE DESCRIPTION: 
ACCTTTAAAA AGTCAACTGT CCTC 

SSQ XD NO: 19 
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SEQUENCE LENGTH: 24 
SEQUENCE TYPES nucleic acid 
5TRANDEDNESSS: single 
TOPOLOGY t linear 

MOLECULE TYPE: other nucleic acid, synthetiic DMA 
SEQUENCE DESCRIPTION: 
AAAMTTATC AMATOAATT TTGG 

SEQ ZD MO I 20 
SEQUENCE LENGTH: 2S 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DMA 
SEQUENCE DESCRIPTION: 
TTACTCTTAG TGAAGACACA AACCC 

SEQ ID NO: 21 
SEQUENCE LENGTH: 3S 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION* 

AGCTCSCGCCC GCGGTCTGCe GCTTAGCTGA AAAT^C 
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S£Q ID KO: 22 
5EQUCKCE LENGTH: 36 
SEQUENCE TYPE: nucl€ic acid 
STMSDZDHESSSs single 
TOPOZfOGT r linear 

MOLECULE TYPE I other nucleic acld^ aynthetic DNA 
SEQUENCE DESCRIPTION: 

AGCTOCOGCC CCAAJkACAAX GAAAAGGAAA TCTGOT 

SBQ ID NO: 23 
SEQUENCE LENGTH: 37 
SEQUENCE TYPEe nucleic acid 
STBANDEDNSSSS: single 
TOPOLOGTs linear 

MOLECULE TYPE: other nucleic acid/ synthetic DNA 
SEQUENCE DSSCRIPTION3 

AGCTQCGGCC GCAAfitCAXG CCACCAXAAA GAAATGA 

SEQ ID NO: 24 
SEQUENCE LENGTH: 36 
SEQUENCE TYPE: nucleic acid 
STHANDSOHESSS: Single 
TOPOLOGY: linear 

MOLECULE TYPE I Other nucleic acid, synthetic DNA 
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S£QU£NC£ DCS CRXPTI OK I 

AGCTGCGGCC GCTGGAAG6A AACACT6AAA TAAGAA 

5EQ ID NO I 25 I 
SEQUENCE LENGTHS 10 
SEQUSKCB TYPEr mino acld 
TOPOItOGY: linear 
MOIiECULE TYPE I protein 
SSQUEKCE DESCRIPTION « 

OIu Gin Ly» Leu lie Ser Glu Glu Aap Leu 
15 10 

5SQ ID NOs 26 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRANDEDNES5S: single 
TOPOLOGY: linear 

MOLECULE TYPEi Other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

6XTGGATCCC 6TCT6CGGCT TAGGTGXAIUl TGC 

SEQ ID NO: 26 
SEQUENCE LENGTHS 60 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS: single 
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TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid^ synthetic DNA 
SEQUENCE DESCRIPTION t 

GATGGATCCT TAGAGGTCTT CTTCTGAGAT GAGCTTCTGC TCCTTCTGCA CTCTCTTTCT 60 

SBQ ID MO: 28 
SEQUENCE LENGTHS 33 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTIONS 

GATGGATCCA GTCATGCCAC CAAAAAGAAA TGA 33 

SEQ ID NO: 29 
SEQUENCE LENGTH: 60 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

GATGGATCCT TAGAGGTCTT CTTCTGAGAT GAGCTTCTGC TCGAGAAAAA ATAAAGCAAG 60 

SEQ ID NO: 30 
SEQUENCE LENGTH: 31 
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SEQUENCE TYPE: nucleic acid 

5 

STRANDEDMESSSs single 
TOPOLOGY: linear 
10 MOI*ECULB TYPE: other nucleic acid, synthetic OKA 

SEQUENCE DESCRIPTION: 

GATGGA!rCCC CATGCCTCGT GTAAAAGCAG C 31 

15 

SEQ ID NO: 31 
20 SEQUENCE I^GTH: 31 

SEQUENCE TYPE: nucleic acid 
STRANDBDNS5SS: single 
TOPOLOGY: linear 

MOLECUI^ TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTIONS 
GATCGATCCC CCJUUIGAAAA GGAAATCTGG T 31 

35 

SEQ ID NO s 32 
SEQUENCE LENGTH: 31 

40 

SEQUENCE TYPE: nucleic acid 
5TRAN0EDNESSS: single 
45 TOPOLOGY: linear 

MOLECULE TYPE! other nucleic acid, synthetic DMA 
SEQUENCE DESCRIPTION: 

GATCGATCCC CATGCCACCA AAAAGAAATG A 31 
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SEQ ID NOt 33 
SEQUENCE LENGTH: 31 
SEQUENCE TYPES nucleic acid 
STRANDEDNESSS : single 
TOPOLOGY s linear 

MOLECULE TTPEs Other nucleic acid« synthetic DNA 

SE(?7ENCE DESCRIPTION: 

GATGGATCCC CACAACATTT CGGTGAAGAG C 

SEQ ID NO: 34 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS: Single 
TOPOLOGY s linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CCTTGTGTTG TATOGGTGGA ACCCA6TGA 

SEQ ID NO: 35 
SEQUENCE LENGTH: 31 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION X 
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ACACCACGAT GCCTGGAGCA ATGGCAACAA C 31 

SEQ ID NO: 36: 
SBQUBNCE LENGTH: 25 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: 

Mat Lau Pro Glu Ser Glu Asp GXu Glu Ser Tyr Asp Thr Glu Ser Glu Pbe Thr 

15 10 15 

Glu Phe Thr Glu Asp Glu Leu 
20 25 



SBQ ID NO: 37: 
SEQUENCE LENGTH: 19 
SEQUENCE TYPE: amino acid 

35 

TOPOLOGY: linear 
MOLECULE TYPE: protein 

40 

SEQUENCE DESCRIPTION: 

Cya Gin Thr Glu Glu Ala He Gin Thr Arg Ser Arg Thr Arg Lya Arg Val Gin 
45 1 5 10 15 

Lya 

50 

Claims 

1. A protein having the amino acid sequtnce of SEQ ID NO: 2. or a protein having the same amino acid sequence 
55 where one or more amino acids are added, deleted, or substituted and having serine-threonine kinase activity. 

2. A protein having the amino acid sequence of SEQ ID NO: 4. or a protein having the same amino acid sequence 
where one or more amino acids are added, deleted, or substituted and having serine-threonine kinase activity. 
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3. A protein encoded by a DNA sequence that hybridizes with the DNA sequence of SEQ ID NO: 1 or its coniplemen- 
tary sequence and having serine-threonine kinase activity. 

4. A protein encoded by a DNA sequence that hybi-idizes with the DNA sequence of SEQ 
tary sequence and having serine-threonine kinase activity, 

5. A DNA encoding the protein of any one of claims 1 to 4. 

6. A vector comprising the DNA of claim 5. 

7. A transformant carrying the vector of claim 6. 

8. A method of producing the protein of any one of claims 1 to 4, wherein the method comprises cultivating the trans- 
formant of claim 7. 

9. An antibody binding to the protein of any one of claims 1 to 4. 

10. An antisense DNA against the DNA of claim 5 or part of it 

20 1 1 . A method of screening compounds having inhibitory activity of serine-threonine kinase activity of the protein of any 
one of claims 1 to 4. wherein the method comprises 

(a) contacting the protein of any one of claims 1 to 4 with a substrate to be phosphorylated by this protein in 
the presence of a test compound to detect the kinase activity of the protein of any one of claims 1 to 4. and 
25 (b) comparing the kinase activity detected in step (a) with that detected in the absence of the test compound 

and selecting a compound that lowers the kinase activity of the protein of any one of claims 1 to 4. 
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Fig. 1 



Adapt:er 1 5* CIAATACGACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAGGT 3' 

3* G6CCCGTCCA S' 

Adapter 2 5' TGTAGC0TCAA6ACGACAGAAA0GGCGTCGTGC0GAG6GCGGT 3' 

3' 6CCTCCCGCCA 5* 
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rig- 2 

A . 

Consensus sequence of the act.xve si'te of 
serine-threonine Kinase 

[Leu, Ile.Val ,Het, Ph€,Tyr,Cys]-Xaa-[His.Tyr]-Xaa-Asp-tLeu, lie, Val . 
Het,Phe,Tyr3-Lys-Xaa-Xaa-Asn-[Leu,ne, Val ,Met,Phe,Tyr,Cys]"lLeu, 
Ilc.Val,HBt.Phe,Tyr.Cyd]-[Leu.ne.Val,Het|Phe.Tyr.Cys] 

B . 

Consensus sequence of the ATP binding site of protein kinase 

[Leu,ne,Val]-Gly-Xaa-Gl y-Xaa-[Phe.Tyr,Trp,Hct.Gly.Ser,Thr,Asn.His] 
-[Ser.Gly,Ala]-Xaa-(Leu, He, Val ,Cys,Ala,Thr]-Xaa-Xaa-tOly, Ser ,Thr, 
Ala,Cy6,Leu, lie, Val ,Mat, Pho,Tyr]-Xaa(6 times or 18 timed)-(Leu, He, 
Val .Met,Phe,Tyr,Trp,Cys,Ser.Thr,Ala,Arg]-[ATa, lie, Val ,Pro]-[Leu, lie, 
Val ,Met , Phe, A1 a , G1 y , Cy 8, Lys, ArgJ-Lys 
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Fig. 3 
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Fig • 4 
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Fig . 5 
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Fig. 6 
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Fig • 7 
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Fig. 8 
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Fig. 9 
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Fig. 10 
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Fig. 12 
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Fig. 13 
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